Search CORE

376 research outputs found

R-Gada: a fast and flexible pipeline for copy number analysis in association studies

Author: A Caceres
AB Olshen
AJ Iafrate
AL Price
Alejandro Cáceres
DF Conrad
G Perry
H Willenbrock
JM Kidd
Juan R González
K Wang
L Winchester
ME Tipping
MJ Greenacre
R Pique-Regi
R Redon
Roger Pique-Regi
S McCarroll
TA Manolio
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Genome-wide association studies (GWAS) using Copy Number Variation (CNV) are becoming a central focus of genetic research. CNVs have successfully provided target genome regions for some disease conditions where simple genetic variation (i.e., SNPs) has previously failed to provide a clear association. Results Here we present a new R package, that integrates: (i) data import from most common formats of Affymetrix, Illumina and aCGH arrays; (ii) a fast and accurate segmentation algorithm to call CNVs based on Genome Alteration Detection Analysis (GADA); and (iii) functions for displaying and exporting the Copy Number calls, identification of recurrent CNVs, multivariate analysis of population structure, and tools for performing association studies. Using a large dataset containing 270 HapMap individuals (Affymetrix Human SNP Array 6.0 Sample Dataset) we demonstrate a flexible pipeline implemented with the package. It requires less than one minute per sample (3 million probe arrays) on a single core computer, and provides a flexible parallelization for very large datasets. Case-control data were generated from the HapMap dataset to demonstrate a GWAS analysis. Conclusions The package provides the tools for creating a complete integrated pipeline from data normalization to statistical association. It can effciently handle a massive volume of data consisting of millions of genetic markers and hundreds or thousands of samples with very accurate results.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Copy-number variation in BMPR2 is not associated with the pathogenesis of pulmonary arterial hypertension

Author: AJ Iafrate
Cindy L Vnencak-Jones
DF Conrad
EH Cook Jr
GH Perry
GM Cooper
James E Loyd
James West
Jennifer A Johnson
JH Newman
JM Kidd
Joy D Cogan
KB Lane
R Hamid
RK Rowntree
SA McCarroll
SA McCarroll
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Copy-number variations (CNVs) are structural variations in the genome involving 1 kb to 3 mb of DNA. CNV has been reported within intron 1 of the <it>BMPR2 </it>gene. We propose that CNV could affect phenotype in familial and/or sporadic pulmonary arterial hypertension (PAH) by altering gene expression. Methods 97 human DNA samples were obtained which included 24 patients with familial PAH, 18 obligate carriers (<it>BMPR2 </it>mutation positive), 20 sporadic PAH patients, and 35 controls. Two sets of primers were designed within the CNV, and two sets of control primers were designed outside the CNV. Quantitative PCR was performed to quantify genomic copies of CNV and control sequences. Results A CNV in <it>BMPR2 </it>was present in one African American negative control subject. Conclusion We conclude that the CNV in intron 1 in <it>BMPR2 </it>is unlikely to play a role in the pathogenesis of either familial or sporadic PAH. Trial Registration NIH NCT00091546.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Homozygous microdeletion of exon 5 in ZNF277 in a girl with specific language impairment

Author: A Gregor
AJ Iafrate
AJ Whitehouse
AJ Whitehouse
Alistair T Pagnamenta
Ann Clark
AT Pagnamenta
AV Molofsky
AV Molofsky
B Bakkaloglu
BJ O'Roak
BP Coe
C Palles
Clyde Francks
CS Leblond
D Malhotra
David R Bentley
DF Newbury
DF Newbury
Dianne F Newbury
E Maestrini
EH Cook Jr
Elena Bacchelli
Elena Maestrini
Elizabeth R Hennessy
Fabiola Ceroni
G Conti-Ramsden
GD Schellenberg
Gillian Baird
Gina Conti-Ramsden
H Liang
Hilary Martin
HJ Kang
International Molecular Genetic Study of Autism Consortium
International Molecular Genetic Study of Autism Consortium
J Law
J Wincent
J Zhang
Jeremy Parr
JT Glessner
K Darvishi
K Stromswold
K Wang
KA Lindgren
KJ Livak
L Mawhood
LA Weiss
LE Vissers
M Bucan
M Falcaro
M Khajavi
M Negishi
MM Kjelgaard
NG Riches
Nuala H Simpson
P Howlin
Patrick F Bolton
Peter Donnelly
S Barrett
S Berkel
S Colella
S Girirajan
S Girirajan
S Girirajan
SC Vernes
SE McCarthy
Simon E Fisher
SJ Sanders
The SLI Consortium
The SLI consortium (SLIC)
Y Shao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Peer reviewedPublisher PD

Aberdeen University Research

Crossref

PubMed Central

Oxford University Research Archive

Radboud Repository

King's Research Portal

Queen Margaret University eResearch

MPG.PuRe

Extensive Copy-Number Variation of Young Genes across Stickleback Populations

Author: A Abyzov
A Alexa
A Conesa
A Hussain
AJ Iafrate
AJ Sharp
AJ Vilella
AR Boyko
AR Quinlan
B Guo
BE Deagle
C Eizaguirre
C Eizaguirre
Christophe Eizaguirre
CL McGrath
CL Peichel
D Bryant
D Juan
D Tautz
DE Cook
DH Huson
DJ Turner
DR Schrider
DR Schrider
DR Zerbino
E Gazave
E Proux
Erich Bornberg-Bauer
FA Kondrashov
FC Jones
Frédéric J. J. Chain
G Gibson
G Orti
GC Conant
GH Perry
GH Perry
GM Cooper
H Kehrer-Sawatzki
H Li
Irene E. Samonte
J Sebat
JA Fawcett
Jianzhi Zhang
JJ Emerson
JK Colbourne
JO Korbel
JO Korbel
K Chen
K Khalturin
K Ye
KJ Lipinski
KJ Livak
KM Teshima
KM Wegner
L Xu
LC Hsing
LR Saraiva
M Hiraiwa
M Long
M Long
M Lynch
M Lynch
M Milinski
M Roesti
MA DePristo
Mahesh Panchal
Manfred Milinski
Martin Kalbe
Monika Stoll
N Ghanem
P Danecek
P Flicek
P Sjödin
PA Hohenlohe
PGD Feulner
PH Sudmant
Philine G. D. Feulner
PM Kim
R Redon
RC Iskow
S Moretti
S Sawyer
SF Altschul
SH Williamson
SM Waszak
SR Browning
T Marques-Bonet
T Rausch
TD Schmittgen
Thorsten B. H. Reusch
Tobias L. Lenz
V Guryev
V Katju
V Katju
V Ranwez
X Huang
Y Hashiguchi
Y Hashiguchi
Y Zheng
YE Zhang
YF Chan
Z Yang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

OceanRep

Crossref

Directory of Open Access Journals

PubMed Central

Queen Mary Research Online

Bern Open Repository and Information System (BORIS)

MPG.PuRe

FigShare

Quantitative Analysis of Single Nucleotide Polymorphisms within Copy Number Variation

Author: AJ Iafrate
AJ Jeffreys
AJ Sharp
BS Weir
Charles R. Cantor
D Fredman
DG Cox
DJ Hunter
DQ Nguyen
E Tuzun
G Zogopoulos
GH Hardy
HW Deng
I Halder
J Ragoussis
J Sebat
J Shoemaker
JA Bailey
JL Hernandez
JM Olson
JO Korbel
KK Wong
L Hosking
NR Council
R Redon
R Sachidanandam
Richard Mayeux
S Colella
S Levy
S Rozen
SA McCarroll
Simon Kasif
SM Leal
Soohyun Lee
TA Trikalinos
TH Emigh
W Weinberg
Zhiping Weng
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

BACKGROUND: Single nucleotide polymorphisms (SNPs) have been used extensively in genetics and epidemiology studies. Traditionally, SNPs that did not pass the Hardy-Weinberg equilibrium (HWE) test were excluded from these analyses. Many investigators have addressed possible causes for departure from HWE, including genotyping errors, population admixture and segmental duplication. Recent large-scale surveys have revealed abundant structural variations in the human genome, including copy number variations (CNVs). This suggests that a significant number of SNPs must be within these regions, which may cause deviation from HWE. RESULTS: We performed a Bayesian analysis on the potential effect of copy number variation, segmental duplication and genotyping errors on the behavior of SNPs. Our results suggest that copy number variation is a major factor of HWE violation for SNPs with a small minor allele frequency, when the sample size is large and the genotyping error rate is 0~1%. CONCLUSIONS: Our study provides the posterior probability that a SNP falls in a CNV or a segmental duplication, given the observed allele frequency of the SNP, sample size and the significance level of HWE testing

Public Library of Science (PLOS)

Crossref

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

PubMed Central

eScholarship@UMMS

Structural Alterations from Multiple Displacement Amplification of a Human Genome Revealed by Mate-Pair Sequencing

Author: AJ Iafrate
C Tanabe
CA Klein
Christian Tellgren-Roth
FB Dean
H Telenius
Jonathan Mangion
JR Nelson
Jörg D. Hoheisel
KJ McKernan
L Lovmar
L Zhang
Liqun He
Magnus Rosenlund
PJ Campbell
PJ Stephens
RS Lasken
S Volik
Sean D. Hooper
T Sjöblom
Tobias Sjöblom
Xiang Jiao
Yutao Fu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Comprehensive identification of the acquired mutations that cause common cancers will require genomic analyses of large sets of tumor samples. Typically, the tissue material available from tumor specimens is limited, which creates a demand for accurate template amplification. We therefore evaluated whether phi29-mediated whole genome amplification introduces false positive structural mutations by massive mate-pair sequencing of a normal human genome before and after such amplification. Multiple displacement amplification led to a decrease in clone coverage and an increase by two orders of magnitude in the prevalence of inversions, but did not increase the prevalence of translocations. While multiple strand displacement amplification may find uses in translocation analyses, it is likely that alternative amplification strategies need to be developed to meet the demands of cancer genomics

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Publikationer från Uppsala Universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

PanSNPdb: The Pan-Asian SNP Genotyping Database

Author: AJ Iafrate
Amanda Ewart Toland
Anunchai Assawamakin
C International HapMap
Chumpol Ngamphiw
D Komura
E Pennisi
Edison Liu
EW Sayers
Ho Ghang
JC Barrett
Jin Ok Yang
Jong Bhak
JS Friedlaender
JZ Li
K Zhang
LD Stein
M Hirakawa
M Jakobsson
M Kayser
P Scheet
Philip J. Shaw
S Jacobs
Shuhua Xu
Sissades Tongsima
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

The HUGO Pan-Asian SNP consortium conducted the largest survey to date of human genetic diversity among Asians by sampling 1,719 unrelated individuals among 71 populations from China, India, Indonesia, Japan, Malaysia, the Philippines, Singapore, South Korea, Taiwan, and Thailand. We have constructed a database (PanSNPdb), which contains these data and various new analyses of them. PanSNPdb is a research resource in the analysis of the population structure of Asian peoples, including linkage disequilibrium patterns, haplotype distributions, and copy number variations. Furthermore, PanSNPdb provides an interactive comparison with other SNP and CNV databases, including HapMap3, JSNP, dbSNP and DGV and thus provides a comprehensive resource of human genetic diversity. The information is accessible via a widely accepted graphical interface used in many genetic variation databases. Unrestricted access to PanSNPdb and any associated files is available at: http://www4a.biotec.or.th/PASNP

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ScholarWorks@UNIST

ScholarBank@NUS

Mathematical Analysis of Copy Number Variation in a DNA Sample Using Digital PCR on a Nanofluidic Device

Author: A Papoulis
AJ Iafrate
AS Kapadia
B Vogelstein
DJ Sheskin
E Fieller
E Fieller
H Motulsky
HH Ropers
J Sebat
Jian Qin
JR Lupski
KK Wong
M Baer
NP Carter
R Redon
R Sindelka
Ramesh Ramakrishnan
RL Scheaffer
Simant Dube
SL Emery
SL Spurgeon
U von Luxburg
U von Luxburg
Xiaolin Wu
YM Lo
Publication venue: Public Library of Science
Publication date: 06/08/2008
Field of study

Copy Number Variations (CNVs) of regions of the human genome have been associated with multiple diseases. We present an algorithm which is mathematically sound and computationally efficient to accurately analyze CNV in a DNA sample utilizing a nanofluidic device, known as the digital array. This numerical algorithm is utilized to compute copy number variation and the associated statistical confidence interval and is based on results from probability theory and statistics. We also provide formulas which can be used as close approximations

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ReadDepth: A Parallel R Package for Detecting Copy Number Alterations from Short Sequencing Reads

Author: AJ Iafrate
Aleksandar Milosavljevic
AM Snijders
C Alkan
C Coarfa
C Xie
Christopher A. Miller
Cristian Coarfa
D Pinkel
DR Bentley
DY Chiang
ES Venkatraman
F Mitelman
H Li
J Castle
J Wang
JL Freeman
K Inoue
L Shayesteh
M Frommer
MD Robinson
Oliver Hampton
P Cohen
R Lister
S Ahn
S Yoon
SA McCarroll
Stein Aerts
Y Ji
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Copy number alterations are important contributors to many genetic diseases, including cancer. We present the readDepth package for R, which can detect these aberrations by measuring the depth of coverage obtained by massively parallel sequencing of the genome. In addition to achieving higher accuracy than existing packages, our tool runs much faster by utilizing multi-core architectures to parallelize the processing of these large data sets. In contrast to other published methods, readDepth does not require the sequencing of a reference sample, and uses a robust statistical model that accounts for overdispersed data. It includes a method for effectively increasing the resolution obtained from low-coverage experiments by utilizing breakpoint information from paired end sequencing to do positional refinement. We also demonstrate a method for inferring copy number using reads generated by whole-genome bisulfite sequencing, thus enabling integrative study of epigenomic and copy number alterations. Finally, we apply this tool to two genomes, showing that it performs well on genomes sequenced to both low and high coverage. The readDepth package runs on Linux and MacOSX, is released under the Apache 2.0 license, and is available at http://code.google.com/p/readdepth/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements

Author: A Bacolla
Adam J. de Smith
AFA Smit
AJ de Smith
AJ Iafrate
AJ Sharp
Alexandra I. F. Blakemore
BE Stranger
CY Chan
D Karolchik
D Karolchik
DA Hinds
DP Locke
E Eden
E Gonzalez
E Tuzun
EV Linardopoulou
GH Perry
GJ Cost
GM Cooper
Israel Steinfeld
J Sebat
J Sebat
JA Lee
JC Barrett
JO Korbel
K Han
K Lee
KK Wong
Lachlan J. M. Coin
M Dewannieux
M Dewannieux
M Fanciulli
M Krawczak
Michael Lichten
P Scheet
PA Callinan
Philippe Froguel
PM Kim
R Chenna
R Redon
RD Wells
Rob Sladek
Robin G. Walters
S Gonzalez-Barrera
S Rozen
SA McCarroll
SK Sen
TJ Hubbard
Zohar Yakhini
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

University of Melbourne Institutional Repository

Brunel University Research Archive

University of Queensland eSpace